Stylistic Changes for Temporal Text Classification
نویسندگان
چکیده
This paper investigates stylistic changes in a set of Portuguese historical texts ranging from the 17 to the early 20 century and presents a supervised method to classify them per century. Four stylistic features – average sentence length (ASL), average word length (AWL), lexical density (LD), and lexical richness (LR) – were automatically extracted for each sub-corpus. The initial analysis of diachronic changes in these four features revealed that the texts written in the 17 and 18 centuries have similar AWL, LD and LR, which differ significantly from those in the texts written in the 19 and 20 centuries. This information was later used in automatic classification of texts per century, leading to an F-Measure of 0.92.
منابع مشابه
محمدکریم پیرنیا و دونالد ویلبر؛ تفاوت مقاصد و عناصر روایت سبکی
Although the discipline of Iranian architectural history has its roots in the recent century, its theories and discourses are still under debate. Stylistic analysis is one of the major tools in art and architectural historiography. In this paper, we discuss differences in the purposes behind the stylistic analyses provided by two major historians of Iranian architecture, namely Mohammad-Karim P...
متن کاملStyle of Religious Texts in 20th Century
In this study, we present the results of the investigation of diachronic stylistic changes in 20th century religious texts in two major English language varieties – British and American. We examined a total of 146 stylistic features, divided into three main feature sets: (average sentence length, Automated readability index, lexical density and lexical richness), part-of-speech frequencies and ...
متن کاملExamination of Authors' Stylistic Elements of Electronic Messages based on Researched Studies
Identifying author is an important issue in natural language processing and text classification. It shows the author's characteristic in various texts. The rapid development of the Internet causes Web-based tools such as email and blogs with an anonymous identity become a popular method of communication for the perpetrators. Moreover, it creates some specific security issues. In this paper, we ...
متن کاملStylistic Analysis of a Poetic Text: A Case from Persian
Poetic analysis involves the explication of a poem by focusing on the process of semiosis in it. Through semiosis linguistic meaning is transformed into stylistic meaning. An examination of semiosis brings us to look at the hypersemanticized poetic structures which are none other than the style features of a poem. Since style functions in a literary text by conveying meanings other than literal...
متن کاملSystemic Functional Features in Stylistic Text Classification
We propose that textual ‘style’ should be best defined as ‘non-denotational meaning’, i.e., those aspects of a text’s meaning that are mostly independent of what the text refers to in the world. To make this more concrete, we describe a linguistically well-motivated framework for computational stylistic analysis based on Systemic Functional Linguistics. This theory views a text as a realisation...
متن کامل